Learning PP attachment for filtering prosodic phrasing

نویسندگان

  • Olga van Herwijnen
  • Antal van den Bosch
  • Jacques M. B. Terken
  • Erwin Marsi
چکیده

We explore learning prepositionalphrase attachment in Dutch, to use it as a filter in prosodic phrasing. From a syntactic treebank of spoken Dutch we extract instances of the attachment of prepositional phrases to either a governing verb or noun. Using cross-validated parameter and feature selection, we train two learning algorithms, TB I and RIPPER, 011 making this distinction, based on unigram and bigram lexical features and a cooccurrence feature derived from WWW counts. We optimize the learning on noun attachment, since in a second stage we use the attachment decision for blocking the incorrect placement of phrase boundaries before prepositional phrases attached to the preceding noun. On noun attachment, IB 1 attains an F-score of 82; RIPPER an F-score of 78. When used as a filter for prosodic phrasing, using attachment decisions from IB 1 yields the best improvement on precision (by six points to 71) on phrase boundary placement.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosodic phrasing and attachment preferences.

The attachment of a relative clause (RC) has been found to differ across languages when its head noun is a complex NP. One attempt to explain the attachment differences is the Implicit Prosody Hypothesis (IPH) proposed by Fodor (1998, 2002). The goal of this paper is to show how the default phrasing of a sentence (explicit prosody), defined phonologically, differs across seven languages (Englis...

متن کامل

Default phrasing and attachment preference in Korean

This paper tests the validity of the Implicit Prosody Hypothesis (IPH) (Fodor 1998, 2002) based on production and perception experiments on Korean data. IPH states that attachment of a relative clause (RC) in a sentence with a complex noun phrase is influenced by a default prosodic contour of the structure projected in silent reading. It predicts that speakers of a language who prefer high atta...

متن کامل

Prosodic phrasing with inductive learning

Prosodic phrasing is an important component in modern TTS systems, which inserts natural and reasonable breaks into long utterance. This paper reports the study of applying several inductive machine-learning algorithms to prosodic phrasing in unrestricted Chinese texts. Two feature sets are carefully selected considering the effectiveness and reliability of them in practice. Then features and t...

متن کامل

A new prosodic phrasing model for indian language telugu

Prosodic phrasing is an important and more difficult a problem for Indian languages, as the Indian language scripts use very little or no punctuation. This paper reports a preliminary attempt on data-driven modeling of prosodic phrase boundary prediction for the Indian language Telugu. In an effort to identify meaningful features that affect the prosodic phrasing, a new feature, namely mopheme ...

متن کامل

A Preliminary Study of Prosodic Disambiguation by Chinese EFL Learners

This study investigated whether Chinese learners of English as a foreign language (EFL learners hereafter) could use prosodic cues to resolve syntactically ambiguous sentences in English. 8 sentences with 3 types of syntactic ambiguity were adopted. They were far/near PP attachment, left/right word attachment and wide/narrow scope. In the production experiment, 15 Chinese college students who p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003